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Abstract. We study Monge's optimal transportation problem, 
where the cost is given by optimal control cost. We prove the ex- 
istence and uniqueness of an optimal map under certain regularity 
conditions on the Lagrangian, absolute continuity of the measures 
with respect to Lebesgue, and most importantly the absence of 
sharp abnormal minimizers. In particular, this result is applicable 
in the case of subriemannian manifolds with a 2-generating distri- 
bution and cost given by cP, where d is the subriemannian distance. 
Also, wc discuss some properties of the optimal plan when abnor- 
mal minimizers are present. Finally, we consider some examples of 
displacement interpolation in the case of Grushin plane. 



Let {X, /i), {y, u) be probability spaces and c : A" x 3^ — R U {+c>o} 
be a fixed measurable function. The Monge's optimal transportation 
problem is the minimization of the following functional 



over all the Borel maps (f) : X ^ y which pushes forward /i to u: 
= v. Maps which achieve the infimum above are called optimal 
maps. In this paper, we will only consider the case when X = y = M 
is a manifold. 

In 1942, Kantorovich studied a relaxed version of the Monge's prob- 
lem in his famous paper [2]. However, a huge step toward solving the 
original problem is not achieved until a decade ago by Brenier. In [8], 
Brenier proved the existence and uniqueness of optimal map in the case 
where M = and the cost function c is given c{x, y) = |x — yp. Later, 
this is generalized to the case of a closed Riemannian manifold M with 
cost given by the square of the Riemannian distance c{x,y) = (P{x,y) 
by McCann |T7j. Recently, Bernard and Buffoni [7] generalized this 
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further to the case where the cost c is the action associated to a La- 
grangian function L : TM ^ M on a compact manifold M. More 
precisely, 

(1) c{x,y)= inf f L{xit),x{t))dt, 

where the infimum is taken over all curves joining the points x and 
y, and the Lagrangian L is fibrewise strictly convex with superlinear 
growth. 

In this paper, we consider costs similar to ([1]). However, instead of 
minimizing among all curves, the infimum is taken over a subcollection 
of curves, called admissible paths. These paths are given by a control 
system and the corresponding cost function is called the optimal control 
cost. Roughly speaking, a control system is a smooth fibrewise mapping 
of a locally trivial bundle over M into TM. Locally, such a mapping 
has a form F : (x, u) i— > F{x, u), where x G M, u & U, F{x, u) G T^-M, 
and f/ is a typical fiber. We assume that ?7 is a closed subset of a 
Euclidean space. Admissible controls are measurable bounded maps 
from [0, 1] to [/, while admissible paths are Lipschitz curves which 
satisfy the equation 

(2) x{t) = F{x{t),u{t)), 

where u{-) is an admissible control. Let L : M x U — i>]Rbea La- 
grangian, then the corresponding cost c is given by 

(3) inf f L(x(t),u(t))dt, 

where the infimum is taken over all admissible pairs (x(-), «(■)) : [0, 1] 
M X U such that x(0) = x, y{0) = y. 

In the interesting cases, the dimension of U is essentially smaller than 
that of M and, nevertheless, any two points of M can be connected by 
an optimal admissible path. In other words, the control system works 
as a nonholonomic constraint. Shortage of admissible velocities does 
not allow to recover an optimal path from its initial point and initial 
velocity and the Euler-Lagrange description of the extremals does not 
work well. On the other hand, Hamiltonian approach remains efficient 
according to the Pontryagin maximum principle. Another problem is 
appearance of so called abnormal extremals (singularities of the space 
of admissible paths) which we are obliged to fight with. 

In sections 2 and 3, we will recall some basic notions in optimal 
control theory and the theory of optimal mass transportation which 
are necessary for this paper. 
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In section 4, by using some standard argument in the theory of op- 
timal mass transportation and the Pontryagin maximum principle in 
optimal control theory, we show the existence and uniqueness of opti- 
mal map under some regularity assumptions (Theorem 14.11) . All these 
conditions are mild except the Lipschitz continuity of the cost function. 
However, this is well-known in all of the above cases mentioned. So, 
the theorem generalizes the work in [H [T71 [7] . 

In section 5, we study the Lipschitz continuity of the cost function. 
If abnormal minimizers are absent, then the cost is not only Lipschitz 
but even semi-concave (see [H]). Unfortunately, abnormal minimizers 
are unavoidable in many interesting problems and, in particular, in 
all subriemannian problems. It happens however that not all abnor- 
mal minimizers are dangerous. To keep the Lipschitz property of the 
cost, (though not the semi-concavity) it is sufficient that the, so called, 
sharp abnormal minimizers are absent. Sharp paths are essentially sin- 
gularities of the space of admissible paths whose neighborhoods in the 
second order approximations are contained in quadrics with a finite 
Morse index. Geometric control theory provides simple effective con- 
ditions of the sharpness (see, for instance, [HE])- These conditions 
allow us to prove Lipschitz continuity for a large class of optimal con- 
trol cost. Hence, proving the existence and uniqueness of optimal map 
of the corresponding Monge's problem (Theorem 16.31) . 

In section 6, we apply the above results to some subriemannian man- 
ifolds, where the cost function is given by the square of the subrieman- 
nian distance (See section 6 for the basic notions in subriemannian 
geometry). In the case of a subriemannian manifold, all the mild reg- 
ularity assumptions are satisfied. Using the result in [B] mentioned 
above (Proposition 15.21) . Lipschitz continuity of the cost can be eas- 
ily proven in the case of a step 2 distribution (Corollary 16. 2p . Hence, 
proving existence and uniqueness of optimal map. This generalizes 
the corresponding result by Ambrosio and Rigot [1] on the Heisenberg 
group. 

In section 7 and 8, we show some properties of the optimal plan when 
abnormal minimizers are present. In section 7, we consider fiows whose 
trajectories are strictly abnormal minimizers. We show that these fiows 
cannot be an optimal plan for all "nice" initial measures if the cost is 
continuous. On the contrary, in section 8, we show that these fiows are 
indeed optimal for an important class of problems with discontinuous 
cost. 

In section 9, we study two examples on Grushin plane for which the 
results in section 3 and 4 apply. 
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2. Elementary Optimal Control Theory 



In this section, we recall some notions from optimal control theory. 
See [13] for detail. Let M be a smooth manifold and let U be 
a closed subset in M™ which is called the control set. Let F : M x 
U TM be a Lipschitz continuous function such that '■= F{-,u) : 
M — TM is a smooth vector field for each point u in the control set. 
Assume that the function {x,u) i— > ^F{x,u) is continuous. Curves 
m(-) : [0, 1] — i> [/ in the control set U which are locally bounded and 
measurable are called admissible controls. 

A control system is the following ordinary differential equations with 
parameters varying over all admissible controls. 



The solutions x(t) to the above control system are called admissible 
paths and {x(t),u(t)) are called admissible pairs. 

By classical theory of ordinary differential equations, a unique solu- 
tion to the system (jlj) exists locally for almost all time t. Moreover, 
the resulting local flow is smooth in the space variable x and Lipschitz 
in the time variable t. The control system is complete if the flows of 
all control vector fields exist globally. 

Let xq and xi be two points on the manifold M. Denote by (Exq 
the set of all admissible pairs for which the corresponding 

admissible paths x{-) start at the point xq. And denote by (t^^ those 
pairs in (t^^ whose admissible paths end ai Xi. A control system is 
called controllable if the set €.^^ is always nonempty for any pair of 
points Xq and Xi on the manifold. 

Let L:Mx?7— s>]Rbea smooth function, called Lagrangian, and 
defined the cost function corresponding to this Lagrangian as follow: 



The cost function defined above is said to be complete if given any 
pairs of points {xo,xi), there exists an admissible pair which achieves 
the infimum above and the corresponding admissible path starts from 
Xq and ends at xi. 

Remark 2.1. The infimum of the problem in ([5]) can be equivalently 
characterized by taking infimum over all admissible controls u{-) such 
that the corresponding admissible paths start at the point Xi, end at 



(4) 



x{t) = F{x{t),u{t)). 



(5) c(a;o,Xi) 
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the point xq of the manifold and satisfy the following control system 

x{s) = —F{x{s),u{s)). 

This point will become important for the later discussion. 

Consider the following minimization problem, commonly known as 
the Bolza problem: 

Problem 2.2. 

inf [\{x{s),u{s))ds-f{x{l)) 

Next, we present an elementary version of the Pontryagin maximum 
principle which we prove in the appendix for the convenience of readers. 
Let vr : T*M — ^ M be the cotangent bundle projection. For each point 
u in the control set U, define the corresponding Hamiltonian function 
Hu : T*M M by 

Hu{Px) = Px{F{x, u)) + L{x, u). 

If if : T*M —s> R is a function on the cotangent bundle, we denote its 
Hamiltonian vector field by H . 



Theorem 2.3. (Pontryagin Maximum Principle for Bolza Problem) 

Let be an admissible pair which achieves the infimum 

in Problem \2.2 . Assume that the function f in Problem \2.2\ is sub- 
differentiable at the point x(l). Then, for each a in the sub-differential 
d~fx(i) of f , there exists a Lipschitz path p : [0, 1] — T*M which sat- 
isfies the following for almost all time t in the interval [0, 1].' 



' 7r(p(t)) = x(t), 
Hu(r){p{t)) = mm Hu{p{t)). 



Remark 2.4. Let A C TM be a distribution on a n-dimensional mani- 
fold M. That is, for each point x in the manifold M, it smoothly assigns 
a vector subspace A^, of the tangent space T^M. Assume that the dis- 
tribution A is trivializable, i.e. there exists a system of vector fields 
Xi, ...,Xk which span A at every point: A^^ = span{Xi(x), 
Consider the following control system: 
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k 



(7) 



x{t) = J2ui{t)X,{x{t)), 



i=l 



with initial condition x(0) = x and final condition a;(l) = y. Recall 
that we denote by (T^, the set of all admissible pair (x(-), such that 
the admissible path x(-) satisfies x{0) = x and a:(l) = y. Let c be the 
cost given by 



If k is equal to the dimension n of the manifold M , then the distribu- 
tion A is the same as the tangent bundle TM of M and the admissible 
paths of the control system ([7]) are all the paths on M. It also defines a 
Riemannian metric on M by declaring that the vector fields Xi, Xn 
are orthonormal everywhere. The cost function c is the square of the 
Riemannian distance d: c = (P. And the minimizers of this system cor- 
respond to the length minimizing geodesies on M. However, this does 
not work for distributions which are not trivializable. (For instance, 
any vector field on the 2-sphere has a zero.) 

To overcome this difficulty, we can modify the general definition of 
control system in the following way. Let V he& locally trivial bundle on 
M with bundle projection vr^ : — s> M and lei F -.V ^ TM be fibre 
preserving map. i.e. FiVx) C T^M. The control system corresponding 
to the map F is given by 



The admissible pairs are locally bounded measurable paths v{-) : [0,1] — *■ 
in such that its projection to the manifold M is a Lipschitz path: 
x(-) = TT^{v{-)) is Lipschitz. If we let V be the trivial bundle M x U, 
we get back the system (jl]). If a Lagrangian L : ^ ^ M is fixed, then 
the corresponding cost function c is defined by 



where the infimum is taken over all admissible pair v{-) : [0, 1] — 
such that the corresponding admissible path x{-) = 7r^(t>(-)) satisfies 
x(0) = X and x(l) = y. 



8 




(9) 



x{t) = F{v{t)). 



(10) 
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Let <, > be a Riemannian metric on the manifold M. If V is the 
tangent bundle TM of M, the map F is the identity map and the 
Lagrangian L : — > R given by L{v) =< v,v >, then the cost function 
c is equal to the square of the Riemannian distance. If < n, then 
the admissible paths of the control system ([7]) are paths tangent to the 
distribution A. Similar to the Riemannian case, the control system 
defines a subriemannian metric <, >'^. (See section 6 for the basics 
on subriemannian geometry) And the cost ([S]) is the square of the 
subriemannian distance ds'. c = dPg. For general distributions A which 
are not trivializable, consider the general control system ^ with V = 
A. And F : A TM is the inclusion map. If the Lagrangian L is 
defined by L{v) =< v, v , then the cost is again the square of the 
subriemannian distance. 

In this paper (except in section 8), we consider the control systems 
of the form (jlj) in order to avoid heavy notations. All the results have 
easy generalization to more general intrinsically defined systems just 
introduced. 



The theory of optimal mass transportation is about moving one mass 
to another that minimizes certain cost. More precisely, let M be a 
manifold and consider a function c : M x M — M U {+oo}, called 
the cost function. Let n and u be two Borel probability measures on 
the manifold M, then the optimal mass transportation is the following 
problem: 

Problem 3.1. Find a Borel map which achieves the following infimum 
among all Borel maps (p : M ^ M that pushes the probability measure 
/i forward to u 



Here, we recall that the push forward (p^fi is defined by (j)^fi{B) = 
fi{(j)~^{B)) for all Borel set B in M. In many cases, such a problem 
admits solution which is unique (up to measure zero), assuming abso- 
lute continuity of the measure with respect to the Lebesgue measure. 
This unique solution to (13.10 is called the optimal map or the Brenier 
map. 

The first optimal map was found by Brenier in [8] in the case where 
the manifold is M" and the cost was c{x,y) = \x — y\'^. Later, it was 
generalized to arbitrary closed, connected Riemannian manifolds in [17] 



3. Optimal Mass Transport 



inf 




8 



ANDREI AGRACHEV AND PAUL LEE 



with cost given by square of the Riemannian distance. The case for 
the Heisenberg group with the cost given by d? was done in [T], where 
d is the subriemannian distance or the gauge distance. In [7j, a much 
general cost given by the action associated to a Lagrangian function 
L : TM ^ M on a compact manifold M was considered. More precisely, 



where the infimum is taken over all curves joining the points x and y. 

Existence and uniqueness of optimal map with the cost given by ffTTl) 
was shown under the following assumptions: 

• The Lagrangian L is fibrewise strictly convex, i.e. the map 
restriction of L to the tangent space T^M is strictly convex for 
each fixed x in the manifold M. 

• L has superlinear growth, i.e. L{v)/\v\ — > as \v\ oo. 

• The cost c is complete, i.e. the infimum ([TT]) is always achieved 
by some smooth paths. 

Recently, the compactness assumption on the manifold or on the mea- 
sures was eliminated [12], [11] . 

In this paper, we consider a connected manifold M without boundary 
and the cost function c is given by ([5]). Consider the following relaxed 
version of Problem 13. Ij called Kantorovich reformulation. Let tti : 
M X M M and : M x M — > M be the projection onto the first 
and the second component respectively. Let F be the set of all joint 
measures 11 on the product manifold M x M with marginals fi and u: 
VTi^n = /i and 7r2*n = u. 

Problem 3.2. 



Remark 3.3. If is an optimal map in the problem in (13. ip . then 
{id X (j))^,fi is a joint measure in the set F. Therefore, Problem 13.21 is a 
relaxation of the problem in fl3.1l) . 

Before we proceed into the existence proof of optimal map, let us 
look at the following dual problem of Kantorovich. See [22] for history 
and the importance of this dual problem to optimal transport. 

Let c be a cost function and let / be a function on the manifold M. 
The ci-transform of the function / is the function f^^ given by 



(11) 



inf / 





r{y):= inf [c(x,y)-/(x)]. 
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The function / is a c-concave function if it satisfies f'^^'^'^ = f. Let 
^ be the set of all pairs of functions {g, h) on the manifold such that 
g : M ^RU {-00} and h : M RU {-00} are in L^{fx) and L^{u) 
respectively, and g{x) + h{y) < c{x,y) for all {x,y) G M x M. The 
dual problem of Kantorovich is the following maximization problem: 

Problem 3.4. 



The existence of solution to the above problem is well-known. See 
[22] and [23] for the proof. 

Theorem 3.5. Assume that there exists two functions ci and C2 such 
that C\ is fi-measurable, C2 is u-measurable and the cost function c 
satisfies c{x,y) < Ci{x) + C2{y) for all {x,y) in M x M. If c is also 
continuous, bounded below and C{^,v) < 00, then there exists a c- 
concave function f such that the function f is in L^{^), its ci-transform 
f^^ is in L^ity) and the pair (/, f^^) achieves the supremum in Problem 



The following theorem on the regularity of the dual pair above is 
also well-known. 

Theorem 3.6. Assume that the cost c{x, y) is continuous, bounded 
below and the measures /i and v are compactly supported. Then the 
functions f and f^^ are upper semicontinuous . If the function x t— > 
c{x, y) is also locally Lipschitz on an set U and the Lipschitz constant 
is independent of y locally, then f can be chosen to be locally Lipschitz 
on U. 

Proof. Fix e > 0. Since f{x) = infa,gA/[c(x, — /^^(y)], there exists 
y such that f{x) + e/2 > c{x,y) — f^^{y). Also, we have f{x') + 
f^^iu) ^ c{x',y) for any x' in M. So, combining the above equations 
and continuity of the cost c, we have 



for any x' close enough to x. Therefore, / is upper semicontinuous. 

Let be a compact set containing the support of the measures /i 
and ly. Let 



-00, ifxeM\K'^^^ \ -00, iixeM\K 




fix') - fix) < e 
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then the pair {g, g') achieves the maximum in Problem I3.4[ Let h = 
{g'Y^, then the pair [h, h'^^) also achieves the maximum. By definition 
of g', we have h{x) = mi[c{x,y) — f^^iy)]- By an argument the same 

as the proof of upper semicontinuity, for any x and x' in the compact 
subset K' of U, we can find y in K such that 

h{x') — h{x) < c(x, y) — c{x', y) + e/2. 

By the assumption of the cost c, the above inequality becomes 

h{x') - h{x) < kd{x,x') + e/2 

for some constant k > which is independent of x on K'. By switching 
the roles of x and x', the result follows. □ 

The following theorem about minimizers of the Problem 13.21 is well- 
known. See, for instance, |22j . 

Theorem 3.7. // we make the same assumption as in Theorem \3.5\ 
then Problem \3.2\ admits a minimizer. Moreover, the joint measure U 
in the set T achieve the infimum in Problem \3.2\ if and only if U is 
concentrated on the set 

{{x,y)eMx M\f{x) + f\y) = c(x, y)}. 

4. Existence and Uniqueness of Optimal Map 

In this section, we show that Monge's problem with cost given by 
an optimal control cost ^ can be solved under certain regularity as- 
sumptions. Let H : T*M — ^ M be the function defined by 

H{p^) = max {px{F{x, u)) - L{x, u)) . 

If H is well-defined and C^, then we denote its Hamiltonian vector 

field by H and let e*^ be its fiow. Let / be the function defined in 
Theorem 13.51 which is Lipschitz for yU-almost all x. Consider the map 

: M X [0, 1] ^ M defined by ip{x,t) = 7r(e*^(-c//^.))- 

Theorem 4.1. The map x ^ V^i(a^) := y^ix,!) is the unique (up to 
^-measure zero) optimal map to the problem liS. 1\) with cost c given by 
^ under the following assumptions: 

(1) The measures n and v are compactly supported and absolutely 
continuous with respect to the Lebesgue measure. 

(2) c is bounded below and c{x, y) is also locally Lipschitz in the x 
variable and the Lipschitz constant is independent of y locally. 
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(3) The cost c is complete, i.e. given any pairs of points (xo,xi) 
in the manifold M, there exists an admissible pair 

such that the pair achieves the infimum in u{-) is locally 
bounded measurable, x(0) = Xq and x(l) = Xi. 

(4) The Hamiltonian function H defined in [W\] is well-defined and 
C\ _^ 

(5) The Hamiltonian vector field H is complete, i.e. global flow 
exists. 

The rest of this section is devoted to the proof of Theorem 14.11 Let 
be the set of all admissible pairs such that the corresponding admissible 
paths x{-) starts from the point y: x(0) = y and satisfies the following 
control system: 



(12) x{t) = -F{x{t),u{t)). 

Let be the set of all those pairs in such that the corresponding 
admissible paths x(-) end at the point x: x(l) = x. 
First, we have the following simple observation. 

Proposition 4.2. Let x be a point which achieves the infimum f^^iy) = 
inf {c{x, y) — f{x)) and let [x, u) be an admissible pair in such that 

the corresponding admissible path x minimizes the cost given by 



c{x,y) = inf _ / L{x{t),u{t)) dt, 
(x(-),M(-))ee:5 Jo 

then {x{-),u{-)) achieves the following infimum 

(13) rn2/)= inf [\{x{s),u{s))ds-f{x{l)). 
ixi-),ui-))e€y Jo 

If x{t) = x{l — t), then x achieves the following infimum 



(14) rn2/) = , , inf ^ / Lixi.s),u{.s))ds-fixm, 

where (T^ denotes the set of all admissible pairs (x(-),m(-)) satisfying 
the following control system: 

x{t) = F{x{t),u{t)), x{l) = y. 

Let u{-) be as in the above Proposition and let u(t) = u{l — t) . Let 
Ht : T*M ^ M be given by = p,{F{x, u{t))) - L{x, u{t)). The 

following is a consequence of Theorem 12.31 
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Proposition 4.3. Let x be a curve that achieves the infimum in ^l 
and let x{t) = x{l—t). Assume that a is contained in the subdifferential 
of the function f at the point x{0), then there exists a Lipschitz curve 
p : [0, 1] — ^ T*M in the cotangent bundle such that the followings are 
true for almost all time t in the interval [0, 1]/ 

f 7l{p{t))=xit), 

Pit) = HtiPit)), 
p(0) = -a, 

Ht{p{t)) = max {p{t){F{x{t), u)) - L{x{t), u)) 



(15) 



Proof. By Theorem 12.31 there exists a curve p : [0, 1] — * T*M in the 
cotangent bundle T*M such that 



( w{p{t))=x{t), 

^^5(t)(pW) =mm(-p(t)(F(x(t),M(t))) + L(J(t),M(t))) 



where Hu{p) = min(— u{t))) + L{x, u{t))). 



Let p{t) = p{l — t) and u{t) = u{l — t), then the equations above 
become 



f 7r(p(t)) =x(t), 
p(0) = -a, 



Hi,(^tMt))=max{p{t){F{^t),u{t)))-L{x{t),u{t))). 



□ 



Assume that the Hamiltonian function H : T*M 



defined by 



(16) 



H{p^) = max {pr,{F{x, u)) - L{x, u)) 



is well-defined and Let H be the Hamiltonian vector field of the 
function H and let e*^ be its fiow. The function / defined in The- 
orem 13.51 is Lipschitz and so it is differentiable almost everywhere by 
Rademacher Theorem. Moreover, the map df : M —>■ T*M is measur- 
able and locally bounded. So, if we let : M x [0, 1] — > M be the map 

defined by (p{x,t) = 7i{e^^ {—df^)), then the map is a Borel map. 
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Proposition 4.4. Under the assumptions of Theorem \4. 1\ the follow- 
ing is true for ^-almost allx: Given a point x in the support of fi, there 
exists a unique point y such that 

f{x) + f^-{y) = c{x,y). 
Moreover, the points x and y are related by y = ip{x,l). 



Proof. We first claim that the infimum f{x) = infygA/[c(x, ?/) — f^^iy)] 
is achieved for fi almost all x. Indeed, by assumption, we have f{x) + 
f^^iy) < c{x,y) for all {x,y) in M x M. Also, let 11 be the measure 
defined in Theorem I3.7[ then f{x) + f^^iy) = c{x,y) for 11- almost 
everywhere. Since the first marginal of the measure 11 is n, the following 
is true for /i almost all x: Given a point x in the manifold M, there 
exists y in M such that f{x) + f^^iy) = c{x, y). This proves the claim. 

Fix a point x for which the infimum inf ygjv/ [c(x, y)—f'^^{y)] is achieved 
and let y be the point which achieves the infimum. By the proof of the 
above claim, x achieves the infimum f^^i^y) = inf^-g^ [c(x, y) — f{x)]. 
Therefore, by completeness of the cost c and Proposition 14.21 there ex- 
ists an admissible path x such that x{0) = x, x(l) = y and x achieves 
the infimum (fT4|) . 

Since / is Lipschitz on a bounded open set U containing the support 
of fi and z/, it is almost everywhere differentiable on U by Rademacher 
Theorem. Since /i is absolutely continuous with respect to the Lebesgue 
measure, / is also differentiable /^-almost everywhere. By Theorem 14. 3[ 
for yU-almost all x, there exists a curve p : [0, 1] T*M in the cotangent 
bundle T*M such that 

p(0) = -df^, 
' 7r(p(t)) =x(t), 

HtiPit)) = max(p(t)(F(x(t),M)) - L{x{t),u)) , 

where Ht is the function on the cotangent bundle T*M given by Ht{px) = 
PxF{x,u{t)) - L{x,u(t)). 

By the definition of H, we have H{p{t)) = Ht{p{t)). But, we also 
have H{p) > Ht{p) for all p G T*M. Since both H and Ht are C^, we 

have dH{p{t)) = dHt{p{t)). Hence, HtiPit)) = H{p{t)) for almost all 
t. The result follows from uniqueness of solution to ODE. □ 

The rest of the arguments for the existence and uniqueness of optimal 
map follow from Theorem 13.71 
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Proof of Theorem \4.1\ As mentioned above, Problem [32] is a relaxation 
of Problem [nm We can recover the later from the former by restricting 
the minimization to joint measures of the form {id x where is 

any Borel map pushing forward fi to u. Therefore, the results follow 
from Theorem 13.71 and Proposition 14. 4[ □ 



5. Regularity of Control Costs 

In Theorem 14.11 we prove existence and uniqueness of optimal maps 
under certain regularity conditions on the cost. Most of the conditions 
in the theorem are easy to verify except condition (2) and (3). In this 
section, we will give simple conditions which guarantee this regularity. 
This includes the completeness and the Lipschitz regularity of the cost. 
First, we recall some basic notions in the geometry of optimal control 
problems, see [2] and reference therein for details. 

Fix a point Xq on the manifold M and assume that the control set 
U is M'^. Denote by (txo the set of all admissible pairs such 
that the corresponding admissible paths x(-) starts at Xq. Moreover, 
we assume that the control system is of the following form: 

k 

(17) xit) = Xo(x(t)) + J2<t)M^it)), 

1=1 

where u(t) = (ui(t), ...,Uk(t)) and Xq,Xi, ...,Xk are fixed smooth vec- 
tor fields on the manifold M. The Cauchy problem for system 0171) 
is correctly stated for any locally integrable vector-function u{-). We 
assume, throughout this section, that system f|T7j) is complete, i. e. all 
solutions of the system are defined on the whole semi-axis [0,-|-oo). 
This completeness assumption is automatically satisfied if one of the 
following is true: (i) if M is a compact manifold, (ii) M is a Lie group 
and the fields Xi are left-invariant, or (iii) if M is a closed submanifold 
of the Euchdean space and |Xj(x)| < c(l + |a;|), i = 0,1, . . . k. 
Define the endpoint map End^^ : L°°([0, 1],^^=) M by 

Endx^{u{-)) = x(l), 

where (x(-),m(-)) is the admissible pair corresponding to the control 
system (ITTl) with initial condition x(0) = xq- It is known that the map 
EndxQ is a smooth mapping. The critical points of the map Endx are 
called singular controls. Admissible paths corresponding to singular 
controls are called singular paths. 
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We also need the Hessian of the mapping End^g at the critical point. 
(See [1] for detail.) Let -E be a Banach space which is an everywhere 
dense subspace of a Hilbert space H. Consider a mapping $ : — > M*^ 
such that the restriction of this map to any finite dimensional 
subspace W of the Banach space E isC^. Moreover, we assume that the 

first and second derivatives of all the restrictions $L, are continuous 

I w 

in the Hilbert space topology on the bounded subsets of E. In other 
words, 

+ w) - ^(v) = D^^{w) + ^Dl^{w) + o(|typ), w e W, 

where is a linear map and -D^$ is a quadratic mapping from E to 
M". Moreover, and -D^$|^ depend continuously on v in 

the topology of H while v is contained in a ball of E. 

The Hessian HesSt,$ : kerD^,$ cokerZ^t,^ of the function $ is the 
restriction of -D^$ to the kernel of with values considered up to 
the image of -D„$. Hessian is a part of -D^$ which survives smooth 
changes of variables in E and M". 

Let p be a covector in the dual space M"* such that pD.^^ = 0, then 
pHesSt,$ is a well-defined real quadratic form on kerD^$. We denote 
the Morse index of this quadratic form by ind(pHess„$). Recall that 
the Morse index of a quadratic form is the supremum of dimensions of 
the subspaces where the form is negative definite. 

Definition 5.1. A critical point of $ is called sharp if there exists a 
covector p such that pD^^ = and ind(pHesSt,$) < +oo. 

Needless to say, the spaces E, H and can be substituted by 
smooth manifolds (Banach, Hilbert and n-dimensional) in all this ter- 
minology. 

Going back to the control system flTTI) . let {x{-),u{-)) be an admis- 
sible pair for this system. We say that the control u{-) and the path 
x{-) are sharp if u{-) is a sharp critical point of the mapping Endxio)- 

One necessary condition for controls and paths to be sharp is the, so 
called, Goh condition. 

Proposition 5.2. (Goh condition) If p{HesSu(-)Endx(o)) < +oo, then 

p{t){XMt)))=Pit)m,X,]{xm = z,j = l,...,A;, 0<t<l, 

where p{t) = P^iP and Pt^r is the local flow of the control system (20) 
with control equal to u{-). 

See [H [6] and references therein for the proof and other effective 
necessary and sufficient conditions of the sharpness. 
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Now consider the optimal control problem 

(18) c{x,y)= M I L{x{t),u{t))dt, 

(x(.),«(-))ee:l Jo 

where the infimum ranges over all admissible pairs {x{t),u{t)) corre- 
sponding to the control system f|T7|) with initial condition x(0) = x and 
final condition x(l) = y. 

Let H : T*M —> M be the Hamiltonian function defined in f|T6l) . For 
simplicity, we assume that the Hamiltonian is C^. A minimizer x(-) of 
the above problem is called normal if there exists a curve p : [0, 1] — > 
T*M in the cotangent bundle T*M such that 7i{p{t)) = x{t) and p(-) 

is a trajectory of the Hamiltonian vector field H . Singular minimizers 
are also called abnormal. According to this not perfect terminology a 
minimizer can simultaneously be normal and abnormal. A minimizer 
which is not normal is called strictly abnormal. 

The following theorem gives simple sufficient conditions for com- 
pleteness of the cost function defined in ffTSjl . It is a combination of 
the well-known existence result (see [20]) and necessary optimality con- 
ditions (see ^). 

Theorem 5.3. ( Completeness of costs) Let L be a Lagrangian function 
which satisfies the following: 

(1) L is bounded below and there exist constants K > such that 
the ratio tends to as \u\ ^ oo uniformly on compact 
subsets of M; 

(2) for any compact C <Z M there exist constants a,b > such that 
\^{x,u)\ < a{L{x,u) + \u\) + b, Wx e C, uR\- 

(3) the function u i— > L{x,u) is a strongly convex function for all 
X e M. 

Then, for each pair of points (x, y) in the manifold M which satisfy 
c{x,y) < +00, there exists an admissible pair (x(-),m(-)) achieving the 
infimum in / flgj) . Moreover, the minimizer x{-) is either a normal or a 
sharp path. 



Remark 5.4. Theorem 15.31 gives lots of examples that satisfy condition 
(3) in Theorem 14.11 In particular, this applies to the case where the 
control set [/ = M'^ and the Lagrangian is L{x, u) = X]i=i '^1- 

Next, we proceed to the main result of of this section which concerns 
with the Lipschitz regularity of the cost function. This takes care of 
the condition (2) in Theorem 14.11 
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Theorem 5.5. (Lipschitz regularity) Assume that the system ^ does 
not admit sharp controls and the Lagrangian L satisfies conditions of 
Theorem\5^ then the set D = {{x, End^{u{-)))\x E M,u E L°°([0, 1], M'^)} 
is open in the product M x M. Moreover, the function {x, y) i— > c(x, y) 
is locally Lipschitz on the set D, where the cost c is given by 

Remark 5.6. In the case where the endpoint map is a submersion, 
there is no singular control. Therefore, Theorem 15.51 is applicable. In 
particular, this theorem, together with Theorem 14.11 and 15.31 can be 
used to treat the cases considered in [HI [13 [7]. In section 5, we will 
consider a class of examples where the endpoint map is not necessarily 
a submersion, but Theorem 15.51 is still applicable. 

The rest of the section is devoted to the proof of Theorem 15. 5[ 

Definition 5.7. Given v in the Banach space E, we write Ind^$ > m 
if 

ind(pHesSt,$) — codimimD^$ > m 
for any p in M"* \ {0} such that pD^^ = 0. 

It is easy to see that {v E E : Ind^$ > m} is an open subset of 
E for any integer m. Let 6^,(5), Bx{e) be radius e balls in E and M"" 
centered at v and x respectively. The following is a qualitative version 
of openness of a mapping $ and any mapping close to it. 

Definition 5.8. We say that the map ^ : E ^ M" is r-solid at the 
point V of the Banach space E if for some constant c > and any 
sufficiently small e > 0, there exists 6 > such that $(B „(£:)) D 
i?l,(^)(ce'') for any $ : B.y{e) — > M" such that sup \^{w)-^{w)\ < 6. 

Implicit function theorem together with Brouwer fixed point theorem 
imply that $ is 1-solid at any regular point. 

Lemma 5.9. Iflnd^^ > then $ is 2-solid at v. 

Proof. This lemma is a refinement of Theorem 20.3 from [1]. It can 
be proved by a slight modification of the proof of the cited theorem. 
Obviously, we may assume that is a critical point of $. Moreover, by 
an argument in the proof of the above cited theorem, we may assume 
that is a finite dimensional space, v = Q and $(0) = 0. 

Let E = El (B E2, where E2 = kerZ^o"^- For any w E E we write 
w = Wi + W2, where Wi E Ei, W2 E E2. Now consider the mapping 

Q:v^ Do<^Vi + 1-Dmv2), v E E. 
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It is shown in the proof of Theorem 20.3 from that Q~^{0) contains 
regular points in any neighborhood of 0. Hence 3 c > such that 
the image of any continuous mapping Q : Bo(l) sufficiently 
close (in C°-norm) to Qlg^^]^) contains Bq{c). Now we set ^^(f) = 

+ £^2); then = Q{v) + o(l) as e — >• and the desired 

property of $ is reduced to the already established property of Q. □ 

The minimization problem (fTS!) can be rephrased into a constrained 
minimization problem in an infinite-dimensional space. For simplicity, 
consider the case where M = M". Let be an admissible pair 

of the control system ([UD and let (p : x L~([0, 1],M'=) ^ M be the 
function defined by 

^{x,u{-))= [ L{x{t),u{t))dt. 
Jo 

Let $ : M" X L°°([0, 1], M^) ^ x be the map 

= {x,End^{u{-))). 

Finding the minimum in f[TB]l is now equivalent to minimizing the func- 
tion Lp on the set y). 

Due to the above discussion, we consider the following general set- 
ting. Consider a function y9 : — > M on the Banach space E such 
that is a C^-mapping for any finite dimensional subspace W of E. 
Assume that the function as well as the first and second derivatives 
of the restrictions ip\w are continuous on the bounded subsets of E in 
the topology of H . Assume that is a bounded subset of E that is 
compact in the topology of H and satisfies the following property: 

ip{v) = m.m{(p{w)\w G E, = ^{v)} 

for any v in the set K. 

We define a function fi on ^{K) by the formula fi{^{v)) = (p{v), v G 
K. 

Lemma 5.10. Iflnd^^ > 2 for any v & K , then fi is locally Lipschitz. 

Proof. Given v in K, there exists a finite dimensional subspace W of the 
Banach space E such that Ind^ ("^Ivf) — ^' Then Ind^ ("^Iv^nkerD ^p) — 
0. Hence „ is 2-sohd at v and 

for some c and any sufficiently small e. 
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Let X = and \x — y\ = ce^, then y = for some w G 

B^{e) nW n ker D^Lp. We have: 

fi{y) — < ^{w) — fi{x) = ^{w) — ip{v) < c\w — f 1^ < c'e'^. 

Moreover, the compactness of K allows to chose unique c, c' and the 
bound for e for all v & K. In particular, we can exchange x and y in 
the last inequality. Hence — < ^\y — x\. □ 

Proof of Theorem 15.51 We perform the proof only in the case M = M" 
in order to simplify the language. Generalization to any manifold is 
straightforward. We set 

E = Wx L~([0,T],M'=), = M" X L2([0,T],M^), 

^{x,u{-)) = {x,End^{u{-))), ^{x,u{-))= [ L{x{t),u{t)) dt 

Jo 

and apply the above results. 

First of all, Ind(a:,u(.))$ = lndu(-)Endx = +oo for all since 
our system does not admit sharp controls. Lemma 15.91 implies that $ 
is 2-solid and V = ^{E) is open. 

Now let i3 be a ball in E equipped with the weak topology of H. The 
endpoint mapping $ is continuous as a mapping from B to M^". Strict 
convexity of L implies that there is some constant c > such that 

ip{xn,Un{-)) - (p{x,u{-)) > c||n„,(-) - n(-)||^2 + o(l) 

as a;„ — *• x, ^ u{-) and G B. Therefore, lim ip{xn,Un{-)) > 

n— >oo 

ip{x,u{-)) and lim (p{xn,Un{-)) = ip{x,u{-)) if and only if (x^, «„(■)) 

n— »oo 

converges to in the strong topology of H. 

Assume that (p{xn,Un{-)) = yu($(a;„, for all n. Inequality 

ip{x,u{-)) < lim ip{xn,Un{-)) would imply that 

n— ►oo 

fi{^{x,u{-))) < lim 

(■)))• 

n— ►oo 

On the other hand, the openness of the map $ implies that 

> lim /i($(x„,M„(-))). 

n— »oo 

Hence lim v^(x„, Un{-)) = ip{x, u{-)) and {xn, Un{-)) converges to (x, u{-)) 

71— >00 

in the strong topology of H. 

Let C be a compact subset of V and 



20 



ANDREI AGRACHEV AND PAUL LEE 



Then K is contained in some ball B. Recall that B is equipped with 
the weak topology; it is compact. Now calculations of previous 2 para- 
graphs imply compactness of K in the strong topology of H . Finally, 
we derive the Lipschitz property of from Lemma [5. 101 □ 



6. Applications: Mass Transportation on Subriemannian 

Manifolds 

In this section, we will apply the results in the previous sections to 
some subriemannian manifolds. First, let us recall some basic defini- 
tions. 

Let A and A' be two (possibly singular) distributions on a manifold 
M . Define the distribution [A, A'] by 

[A, A']a; = span{[?;, is a section of A,w is a section of A'}. 

Define inductively the following distributions: [A, A] = A^ and A^ = 
[A,A'^~^]. A distribution A is called fc-generating if A^ = TM and 
the smallest such k is called the degree of nonholonomy. Also, the 
distribution is called bracket generating if it is fc-generating for some 
k. 

If A is a bracket generating distribution, then it defines a flag of 
distribution by 

A c A^ C ... C TM. 

The growth vector of the distribution A at the point x is defined by 
(dim Aj,., dim A^, dim Tj-M). The distribution A is called regular if 
the growth vector is the same for all x. Let x{-) : [a, 6] — > M be an 
admissible curve, that is a Lipschitz curve almost everywhere tangent 
to A. The following classical result on bracket generating distributions 
is the starting point of subriemannian geometry. 

Theorem 6.1. (Chow and Rashevskii) Given any two points x and y 
on the manifold M with a bracket generating distribution, there exists 
an admissible curve joining the two points. 

Using Chow-Rashevskii Theorem, we can define the subriemannian 
distance d. Let <, > be a fibre inner product on the distribution A, 
called subriemannian metric. The length of an admissible curve x(-) 

is defined in the usual way: length(x(-)) = £ \/< x{t),x{t) > dt. The 
subriemannian distance d{x, y) between two points x and y is defined 
by the infimum of the length of all admissible curves joining x and 
y. There is a quantitative version of Chow-Rashevskii Theorem, called 
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Ball-Box Theorem, which gives Holder continuity of the subriemannian 
distance. See [H] for detail. 

Corollary 6.2. Let ds be the metric of a complete subriemannian space 
with distribution A. Function (i| is locally Lipschitz if and only if the 
distribution is 2-generating. 

Proof. The systems with 2-generating distributions do not admit sharp 
paths because these systems are not compatible with the Goh condi- 
tion. On the other hand, constant paths (points) are sharp minimizers 
in the case of distributions whose nonholonomy degree is greater than 
2 and the ball-box theorem implies that d'^ is not locally Lipschitz at 
the diagonal in this case. □ 

Combining Corollary 16.21 with Theorem 14.11 we prove the existence 
and uniqueness of optimal map for subriemannian manifold with 2- 
generating distribution. 

Theorem 6.3. Let M be a complete subriemannian manifold defined 
by a 2-generating distribution, then there exists a unique (up to ix- 
measure zero) optimal map to the Monge's problem with the cost c 
given by c = d"^. Here ds is the subriemannian distance of M. 

Remark 6.4. The locally Lipschitz property of the distance d out of the 
diagonal is guaranteed for much bigger class of distribution. In partic- 
ular, it is proved in [3] that generic distribution of rank > 2 does not 
admit non-constant sharp trajectories. In the class of Carnot groups, 
the following estimates are valid: Generic n-dimensional Carnot group 
with rank k distribution does not admit nonconstant sharp trajectories 
ifn< {k — l)k + 1 and has nonconstant sharp length minimizing tra- 
jectories if n > (/c — + Y + -'-)• R-scall that a simply-connected Lie 
group endowed with a left-invariant distribution Vi is a Carnot group 
if the Lie algebra g is a graded nilpotent Lie algebra such that it is Lie 
generated by the block with lowest grading (i.e. g = Vi © V2 © ... © Vk, 
[Vi, Vj] = Vi+j, Vi = if i > k and Vi Lie-generates g). 

Clearly, if the cost is locally Lipschitz out of the diagonal, then the 
statement of Theorem 4.1 remains valid with the extra assumption 
that the supports of the initial fi and the final measures u are disjoint: 
supp{fi) n suppi^u) = 0. 
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7. Normal minimizers and Property of Optimal Map with 
Continuous Optimal Control Cost 

According to Theorem 15.51 it remains to study the case where sharp 
controls exist. In this section, we will prove a property of optimal 
map when the cost is continuous. Normal minimizers will play a very 
important role. 

We continue to study optimal control problem (20), (21). As we 
already mentioned, strictly abnormal minimizers must be sharp. In 
addition, if Xq = 0, then optimal control cost is continuous. According 
to the discussion at the end of the previous section, we expect strictly 
abnormal minimizers mainly for generic rank 2 distributions on the 
manifold of dimension greater than 3 and for generic Carnot groups of 
big enough corank. In these situations, strictly abnormal minimizers 
are indeed unavoidable. 

The existence of strictly abnormal minimizers for subriemannian 
manifolds is first done in [18]. In [21] and [15], it is shown that there 
are many strictly abnormal minimizers in general for subriemannian 
manifolds. (See, for instance. Theorem 17.11 below.) Finally, a general 
theory on abnormal minimizers for rank 2 distributions is developed 
in [5]. See [19] for a detail account on the history and references on 
abnormal minimizers. 

Here is a sample result in [21] which is of interest to us. 

Theorem 7.1. (Liu and Sussman) Let M he a ^-dimensional manifold 
with a rank 2 regular bracket generating distribution A and subrieman- 
nian metric <,>. Let Xi and X2 be two global sections of A such 
that 

(1) Xi and X2 are everywhere orthonormal, 

(2) Xi, X2, [Xi,X2] and [X2, [Xi,X2]] are everywhere linearly de- 
pendent, 

(3) X2, [Xi,X2] and [X2, [Xi,X2]] are everywhere linearly indepen- 
dent. 

Then any short enough segments of the integral curves of the vector 
field X2 are strictly abnormal minimizers. 

We call a local flow a strictly abnormal flow if the corresponding tra- 
jectories are all strictly abnormal minimizers. An interesting question 
is whether time-1 map of an abnormal flow is an optimal map. The 
following theorem shows that this is not the case for any reasonable 
initial measure and continuous cost. 
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Theorem 7.2. Assume that the cost c in 1^3. 1\) is continuous, bounded 
below and the support of the measure fi is equal to the closure of its 
interior. If if : M ^ M is a continuous map such that {id x (f)^fi 
achieves the infimum in Problem \3.2l then x and (p{x) are connected 
by a normal minimizer on a dense set of x in the support of fi. 

Proof. By Theorem 13.51 there exists a function / : M — M U {— cxd} 
such that / and its Ci-transform achieve the supremum in Problem 
I3.4[ Moreover, by Theorem 13.61 the functions / and f^^ are upper 
semicontinuous. By Theorem 13.71 

(19) fix) + f'^i^ix)) = c{x,^{x)) 

for /i-almost all x. By upper semi continuity of / and f^^, 

f{x) + f'^{^{x))>c{x,^{x)). 

But /(x) + f^i^y) < c{x,y) for any x,y in the manifold M. So, (fT9l) 
holds for all x in the support U of fi. Therefore, x achieves the infimum 
f'^'^{(j){x)) = mfz(zM[c{z, (f){x)) — f'^'^{z)] for all x in the support of /i. 
Moreover, using (IT^ . it is easy to see that the function / is continuous 
on U. In particular, it is subdifferentiable on a dense set of U. By 
Proposition 14.21 and Theorem 14.31 x and f{x) is connected by a normal 
minimizer if / is subdifferentiable at x. This proves the theorem. □ 



8. Optimal Maps with Abnormal Minimizers 

In this section, we describe an important class of control systems 
which admit smooth optimal maps built essentially from abnormal min- 
imizers. Recall that abnormal minimizers are singular trajectories of 
the control system whose definition does not depend on the Lagrangian. 

Let p : M — > be a smooth principal bundle where the structural 
group G is a connected Abelian Lie group. Let Xi,...,Xk be the 
vertical vector fields which generate the action of G. Consider the 
following control system 

k 

(20) x{t) = Xo{x{t)) + J2^^it)M^it)), 

1=1 

where Xq is a smooth vector field on M, and the re-scaled systems 

k 

(21) x{t) = eXoixit)) + Y,u^it)XMt)) 

1=1 
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for e > 0. 

We define the Hamiltonian H ■.T*N ^Rhj 

(22) H{p,) = maxK(t/p(Xo(y))|y G p~\x)} 

wliere px is a covector in T*N. We assume that the maximum above 
is achieved for any p in T*N and is finite. 

Typical example is the Hopf bundle (f) : SU(2) — > S and a left- 
invariant vector field Fq. Then H{p) = a\p\, where a is a constant and 
\p\ is the length of the covector p with respect to the standard (constant 
curvature) Riemannian structure on the sphere. (See [U Section 22.2]) 

Consider the following control system on with admissible pair 

G 

y{-) contained in the G-bundle p : M — > and admissible trajectory 
x(t) = p{y{t)) (See Remark [9]): 

(23) x{t) = dp{Xo{ym- 

The function H in fl22l) is the Hamiltonian of the time-optimal problem 
of the control system fl23l) . (Recall that the time optimal problem is 
the following minimization problem: Fix two points Xq and Xi in N 
and minimize the time ti among all admissible trajectories x(-) of the 
control system fl23|l such that x{to) = xq and x{ti) = Xi.) 

System fl23|) is the reduced system associated to system fl20l) ac- 
cording to the reduction procedure described in [H Chapter 22]. In 
particular, p transforms any admissible trajectory of system fl2Ul) to 
the admissible trajectory of system fl2^ . Also, the smooth extremal 
trajectories of the time-optimal problem for system (!23|) are images 
under the map p of singular trajectories of system (120|) . 

For any e > and any smooth function a : — M, we introduce 
the map 

<!>l:N-.N, $^(x) = 7r(e^^(4a)), x e N, 

where vr : T*N ^ N is the standard projection and t ^ e*^ is the 
Hamiltonian flow of H. Set 

V = {pe T*N : H{p) > 0, H is of class at p}. 

Assume that $^ pushes the measure p' forward to another measure 
u' on A^. Consider some "lifts" p and u of the measures p' and u': 
p^:P = /i',p*i^ = z/'. Let : M — > M be an optimal map pushing 
forward p to z/, then the following theorem says that is a covering of 
p o ^ = $^ o p. By the discussion above, we see that x and \E'(x) 
are connected by singular trajectories as claimed. 

Theorem 8.1. Let K he a compact subset of N and a G C'^{N). As- 
sume that da\K C V. Let p and v he Borel probability measures such 
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that supp (p*(/i)) C K. Then, for any sufficiently small £ > and any 
optimal Borel map : M ^ M of the control system l[EJ\) with any 
Lagrangian L, the following is true whenever p^^i^u) = $^^(p,,(/i)).- 



Proof. We start from the following. 

Definition 8.2. We say that a Borel map Q : K ^ N is e-admissible 
for system ( l2Ti) if there exists a Borel map ip : K ^ L°°i['^, s], G) such 
that 

Q(xo) = x(e;(/9(xo)(-)) , Vxo G A', 
where t i— *■ x (t; v?(a:o)(")) is an admissible trajectory of the reduced 
control system (123|) with initial condition x (0; (/?(xo)(-)) = ^^o- 

We are going to prove that $^ is an admissible map, unique up to 
a p^, (/x)-measure zero set, which transforms p*(/i) into p^i^u). This fact 
implies the statement of the theorem. 

Inequality H{dxa) > implies that d7r{H{dxa)) is transversal to the 
level hypersurface of a through x. Hence the map $^ is invertible on 
a neighborhood of K for any sufficiently small e. Moreover, the curve 
t I— > < t < e, is a unique admissible trajectory of system 

[23] which starts at the hypersurface a~^{a{x)) and arrives at the point 
^a{x) at time moment not greater than e. The last fact is proved by a 
simple adaptation of the standard sufficient optimality condition (see 
H Chapter 17]). 

Now we set 



then ag is a smooth function defined on a neighborhood of K. 
Optimality property of implies that 



for any e-admissible map Q and any x G K, and the inequality is strict 
at any point x where Q{y) ^ ^l{x). In particular, if 



p o ^ = $^ o p. 



a,{x) = a{{<!>iy\x))+e, 



ae{Q{x))<a,mx)) 



p,(p) {{x e K : Q{x) ^ <l>l{x)}) > 0, 



then 




Hence (5*(p*(p)) ^ p*(i^). 



□ 
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9. Example: the Grushin plane 

Grushin plane is the subriemannian space with base space and 
a singular distribution defined by the span of the following vectors 
{dxT^,xidx2} in each tangent space. In other word, the fibre of this 
distribution is the whole tangent space of if Xi 7^ and it is spanned 
by (9a;^ otherwise. We define a subriemannian metric by declaring that 
the two vector fields above are orthonormal. The control system is 
given by 

Xi = Ml, X2 = U2X1. 

The subriemannian distance d is given by d{x, y) = inf ^u{^ dt. 

In this section, we consider the optimal transport problem with cost c 
given by c = d^. 

There is no abnormal minimizer for this problem, so we consider its 
Hamiltonian H given by 

H{xi,X2,Pi,P2) = ^{pI + x\pI). 
The corresponding Hamiltonian equation is 

ii = Pi, X2 = xjp2, Pi = -xipj, P2 = 0. 

For simplicity, we consider the case xi(0) = = 0:2(0). And we let 
Pi(0) = a and ^2(0) = b. In this case, the solutions give geodesies 
emanating from a point (0, 6) on the y-axis. They are parameterized 
by (a, 6) and are given by 

2 

(24) xi{t) = y sin(6t), X2{t) = ^{2bt - sin(26t)) + S 

46^ 

if 6 7^ and given by 

(25) xi{t) = at, X2it) = 5 

if 6 = 0. A geodesic is length minimizing if and only if— vr/fe < t < Ti/h. 

Next, we consider the mass transport problem. Let d be the subrie- 
mannian distance of the Grushin plane and consider Problem I3.1l ivith 
cost c given by square of the subriemannian distance d^. We also spe- 
cialize to the case where the target measure v is equal to the delta mass 
supported at the origin. In this case, the optimal map is clearly given 
by the constant map x 1— > (0, 0). We are interested in the displacement 
interpolation corresponding to this optimal map. Recall that displace- 
ment interpolation is the one parameter family of maps 0t such that 0t 
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Figure 1. Some displacement interpolations 
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Figure 2. Graph of the function / 



is the optimal map with the cost Ct given by the following: 
Cf(x, ?/)=inf / L{x{s),u{s)) ds 



where the infimum ranges over all admissible pairs {x{-),u{-)) of the 
control system (jl]) with initial condition x{0) = x and final condition 

x{t) = y. It is easy to see that if (pi = 7i{e^ {—df)) as in Theorem 

14.11 then the displacement interpolation (pt is given by 'K{e^^ {—df)). 
Moreover, the displacement interpolation is related to the Hamilton- 
Jacobi equation via the method of characteristics. See [7] and [TU] for 
details. 

To do this, we first evaluate the equations ^IM and fl2^ at t = 1. 
Then we solve a and h in terms of Xi(l) and X2(l). If / : (— vr, vr) M 
is the function defined by f{h) = then / is invertible. A 

computation shows that 

r'(^)^-i(i) / x,(i)-j N 

"^.^(/-(^))■ y -.(IP >■ 

Therefore, the displacement interpolation is given by 

2 

ift{xuX2) = sin(6(l - t)), ^(2t6 - sin(2(l - t)h) + 5^ , 
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where a = a(xi, X2) and b = X2) are given by 



^1 



sin /-M^ ^ ^1 



10. Appendix 

This appendix is devoted to the prove of Theorem I2.3[ The first 
step is to reduce the problem into a simpler one. Recall that the Bolza 
problem is the following minimization problem: 

, . / L{x{s),u{s))ds- j{x{\y) 

(x(-),«(-))6e:.o h 

where the infimum is taken over all admissible pair (x(-), «(■)) satisfying 
the control system 

x{s) = F{x{s), u{s)) 

and initial condition x{0) = xq. 

Let X = {x, z) be a point in the product manifold M x M and consider 
the following extended control system on it: 

(26) ^ = u) := u), L{x, u)). 

Note that x(-) = {x{-), z{-)) satisfies this extended system and initial 
condition a;(0) = (xq, 0) if and only if x(-) satisfies the original control 
system in the Bolza problem with the initial condition a;(0) = Xq and 
z{t) = Jq L{q{s),u{s)) ds. Therefore, Problem 12.21 is equivalent to the 
following problem. 

Problem 10.1. 

(27) inf {z{l)-f{xm, 

where the infimum is taken over all admissible pair satisfying the ex- 
tended control system 



Problem 1 10. II is an example of the Mayer problem. Let g : N ^Whe 
a function on the manifold and the Mayer problem is the following 
minimization problem: 



Problem 10.2. 



inf ^(x(l)) 
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where the infimum is taken over all admissible pair (x(-), u{-)) satisfying 
the control system 

X = F(x, u) 
on and initial condition x(0) = xq. 

For each point u in the control set [/, define the corresponding Hamil- 
tonian function F„ : r*A^ ^ R by 

Hu{Px) =Px{F{x,u)). 



Theorem 10.3. (Pontryagin Maximum Principle for Mayer Problem) 
Let (x(-), ?!(■)) he an admissible pair which achieve the infimum in 
Problem \10.2 . Assume that the function g in Problem \10.2\ is super- 
differentiable at the point x(l) and let a be in the super- differential 
^^9^(1) 9- Then there exists a Lipschitz path p{-) : [0, 1] T*N 
which satisfies the following for almost all time t in the interval [0, 1]; 



' 7r(p(t)) = x(t), 
p{l) = a, 

m=^uit){m)._ 

Hu{t){p{t)) = mm Hu{p{t)) 



Proof. Fix a point v in the control set and a number r in the interval 
[0, 1]. For each small positive number e > 0, let be the admissible 
control defined by 

' u{t), if t ^ [r-e,r]; 
V, if t e [r — e, t]. 

Since the optimal control u is locally bounded, the new control 
defined above is also locally bounded. Let P^^ : iV — >■ iV be the time- 
dependent local fiow of the following ordinary differential equation 

T{t) = F{x{t),u,{t)). 

Here, Pq denotes the image of the point x in the manifold under 
the local fiow Pq^ at time t. It has the property that Pt^^ta ° ^ti,t2 ~ 
P^^ Also, recall that -P^^ depends smoothly on the space variable, 
Lipschitz with respect to the time variables. 

Since x(l) = Pq^(xo) and the function g is minimizing at x(l), the 
following is true for all e > 0: 



(29) 



g{Pl^ixo))>giP^4xo)). 
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Let a be a point in the super-differentiable d^g^^^-^^ at the point x(l), 
then there exists a function : — s> M such that d(j)^^^^ = a and 
g — (p has a local maximum at Combining this with fl29l) . we have 

^?(P°i(xo))-0(Po^,l(^o))< 

giPl,{xo)) - 0(Po^,i(xo)) < ^7(Po°i(^o)) - 0(^^o°i(^o)). 
Simplifying this equation, we get 

^3Q^ 0(Po-i(xo))-0(P°i(xo)) ^ ^ 

If Rt denotes the flow of the vector field Fy, then 



(31) Po,i = o o pO^_^. 

So, if we assume that r is a point of differentiability of the map 
t Pq J which is true for almost all time r in the interval [0, 1], then 
Pq-^ is different iable with respect to e at zero. Therefore, we can let e 
goes to in (l30l) and obtain 



(32) 



a 



d 
de 



e=0 



PL > 0. 



If we differentiate equation fl3T|) with respect to e and set it to zero, 
it becomes 



d_ 

Je 



e=0 



Pl, = {P'rMF^'-F^ir))oPl,. 



Substitute this equation back into (!32|) . we get the following: 



(33) ((P°i)*a)(P.(x(r)) - Pii(.)(x(r))) > 0. 

Define p : [0, 1] T*N by p(t) = (P°i)*a, then the first two asser- 
tions of the theorem are clearly satisfied. 
The following is well known (See [Ij or [T6]). 

Lemma 10.4. Let 9 = pdq be the tautological l-form on the cotangent 
bundle of the manifold N, then for each diffeomorphism P : N ^ N , 
the pull back map P* : T*N — >• T*N on the cotangent bundle of the 
manifold preserves the l-form 9. 
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Let Wt be the time-dependent vector field on the cotangent bundle 
of the manifold which satisfies 

for almost all time t in [0, 1]. If Cy denotes the Lie derivative with 
respect to a vector field V, then, by Lemma [10. 4[ the following is true 
for almost all time t in [0, 1]: 

Cwt^ = 0. 

If u; = — ^6* is the canonical symplectic 2- form on the cotangent 
bundle, then, by using Cartan's formula, we have 

zw.u; = d{9{W,)). 

Therefore, the vector field Wt is a Hamiltonian vector field with Hamil- 
tonian given by 

The third assertion of the theorem follows from this. The last assertion 
follows from fl33l) . □ 

Going back to Problem 110.11 we can apply Pontryagin Maximum 
Principle for Mayer problem. Let {x{-),z{-)) be an admissible pair 
which minimizes Problem 110.11 and let : T*M x M — >^ M be the 
function defined by 

Htip, I) = p{F{x, u{t))) + / ■ u{t)). 

By Theorem [laa there exists a curve {p{-),T{-)) : [0,1] ^ T^M x M 
such that x{t) = 7r(p(t)) and 

(34) <^ (p(l),/(l)) = (-a,l), 

Ht{vit)Ai)) = min (p{t){F{x{t),u)) +l{t) ■ L{x{t),u)) 

From the first equation in (l34l) . we get / = 0. So, /(t) = 1. Therefore, 
flM|) is simplified to 

p{l) = -a, 

H^irn. Pit)) = min (p(t)(F(J(t), u)) + L{x{t), u)) . 
This finishes the proof of Theorem 12.31 
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